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ABSTRACT 



We report results from a study of the HI content and stellar properties of 
nearby galaxies detected by the Arecibo Legacy Fast ALFA blind 21-cm line 
survey and the Sloan Digital Sky Survey. We consider two declination strips 
covering a total area of 9 hr x 16 deg in the general direction of the Virgo 
Cluster. The present analysis focuses on gas-rich galaxies expected to show little 
or no evidence of interaction with their surroundings. We seek to assemble a 
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control sample suitable for providing absolute measures of the HI content of 
gaseous objects, as well as to study the relationship between HI emission and 
widely-used optical measures of morphology. From a database which includes 
more than 15,000 HI detections, we have assembled three samples that could 
provide adequate HI standards. The most reliable results are obtained with a 
sample of 5647 sources found in low density environments, as defined by a nearest 
neighbor approach. The other two samples contain several hundred relatively 
isolated galaxies each, as determined from standard isolation algorithms based 
either on a combination of spectroscopic and photometric information or solely 
on photometric data. We find that isolated objects are not particularly gas- 
rich compared to their low-density-environment counterparts, while they suffer 
from selection bias and span a smaller dynamic range. All this makes them less 
suitable for defining a reference for HI content. We have explored the optical 
morphology of gaseous galaxies in quiet environments finding that, within the 
volume surveyed, the vast majority of them display unequivocal late-type galaxy 
features. In contrast, bona fide gas-rich early-type systems account only for a 
negligible fraction of the 21-cm detections. We argue that HI emission provides 
the most reliable way to determine the morphological population to which a 
galaxy belongs. We have also observed that the color distribution of flux-limited 
samples of optically-selected field HI emitters does not vary significantly with 
increasing distance, while that of non-detections becomes notably redder. This 
result suggests that the colors and H I masses of gas-rich galaxies cannot be very 
closely related. 



Subject headings: PACS: 98.52. Cf, 98.52.Nr, 98.62.Lv, 98.62.Py, 98.62. Qz, 98.62.Ve 



1. INTRODUCTION 



One of the defining characteristics of recent extragalactic research is the execution of 
wide-area galaxy surveys, which have in many ways revolutionized our understanding of 
both large-scale structure and galaxy evolution. Most of them have been conducted in the 
optical/near- infrared regime, where stellar light dominates. Multicolor imaging surveys of 
virtually the entire sky, some of which c ontain over a rnillion extended sources, notably 
the Two Micron All Sky Survey (2MASS; ISkrutskie et al.l 120061 ) . have been complemented 
by spectroscopic surveys, such as the fifth edition of the Sloan Digital Sky Survey (SDSS) 
Quasar Catalog ( ISchneider et al.ll2010l ). based upon the SDSS Seventh Data Release (DR7), 



which ranges in redshift from the local universe to near the reionization epoch {z < 6) when 
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the first galaxies may have appeared. 



The improvement in technological capabilities over the past decade has enabled the 
astronomical community to build large homogeneous samples of extragalactic objects in other 
regions of the electromagnetic spectrum as well — the radio wavelength regime being one of 
the most prolific in this respect — , allowing us to develop a more thorough account of the 
extragalactic realm. While early blind neutral hydrogen (H I) surveys , such as the Arecibo H I 
Strip Survey (IZwaan et al.lll997l ) or the Arecibo Dual-Beam Survey (IRosenberg &: Schneider 
2OO2I ) . yielded useful estimates of the HI mass function from shallow samples containing a 
relatively modest number of detections, the advent of focal plane arrays at 21-cm enabled 
to carry out wide sohd angle surveys that led to a significant increase in the depth and, 
especially, size of the datab ases. Between 1997 and 2001, the HI Parkes All-Sky Survey 
(HIPASS; iMeyer et al.l 120041 ) used a thirteen-beam array to sample the entire southern sky 
and the northern up to Decl. < +25°, with a beam of 15.5 arcmin and a sensitivity of 17 
mJy per beam at a spectral resolution of 18 km s~^. With a velocity range of —1280 to 
12, 700 km s~^ and more than 4000 HI sources identified in the southern sky alone, HIPASS 
was the first large untargeted H I survey that approached the dimensions necessary for a fair 
sampling of the local volume. 

Currently, a seven-beam array named ALFA (Arecibo L-band Feed Array), installed at 
the Arecibo telesco pe in 2004, is being u sed to carry out the Arecibo Legacy Fast ALFA 
Survey (ALFALFA; iGiovanelli et al.ll2005l ). Its aim is to map, in the HI line, 7000 square 
degrees of the high galactic latitude sky between 0° and +36° in declination. ALFALFA offers 
significantly higher angular (FWHM ~ 3^5) and spectral (~ 5.5 km s~^) resolution, as well as 
higher sensitivity (cTrms = 2.2 mJy/beam at 10 km s~^ resolution) than any previous survey 
of its kind. It is also deeper, with a radial extent (—2000 < cz^ici ^ 18,000 km s~^) and 
a median depth (czhei — 8000 km s~^) that should be sufficient to sample satisfactorily the 
largest structures seen in numerical simulations in a ACDM world model (~ 100 Mpc). 
While HIPASS detected 1 galaxy every ~ 5 square degrees, ALFALFA is detecting ~ 5 
galaxies deg~^. The current detection rate suggests that ALFALFA will have detected over 
30,000 extragalactic HI line sources when completed. An account of the ALFALFA survey 
goals and technical characteristics, as well as its main contributions to science, can be found 
on the website http : / /egg . astro . Cornell . edu/alf alf a/ and references therein. 

This is the first of two papers in which we deal wi th a compilation of the new ALFALFA 
data, partially released already in severa l insta l lments jGiovanelli et al. 2007; Saintonge et al. 



2008 



Kent et al. 2008; Stierwalt et al. 2009; Martin et al. 



20091 ). The data are from two 



separate sky strips of the northern Galactic hemisphere covering a total area of 9 hr in 
right ascension and 16 deg in declination. This portion of the survey is especially inter- 



-4- 



esting because it overlaps almost entirely with the SDSS footprint, thus offering the pos- 
sibility of cross-correlating observations of the cold-gas component with multi-wavelength 
optical measurements. This synergy between t he surveys will be exploited in this paper 
and the accompanying one (IToribio et al.l |2010| . hereafter Paper II) in order to carry out 
a systematic analysis of the fundamental properties of nearby gaseous galaxies and their 
interrelations. In this way we hope both to gain a more complete understanding of the 
structure of the baryonic component in these systems and to set up unbiased benchmarks 
for the normalcy of their HI content that rely on integral attributes that are easily accessi- 
ble to observation. While this is not the first attempt to conduct this sort of research (e.g . 



Havnes fc Giovanellilll984 : 



Garcia-Appadoo et al. 



20091 



Roberts fc Hayneslll994j : iGavazzi et al.lll996t ISolanes et al.lll996 
1), both the significant increase in the sample size and the high 



degrees of homogeneity and completeness of the data will allow us to extend the results of 
previous studies by addressing issues that fell outside their scope. 

In this paper, we describe the main statistical properties of the two large catalogs used 
in our research. We also: describe the steps followed to cross-correlate the radio and optical 
sources and to calculate their radial velocities (Section H]); explore different characterizations 
of the environment of ALFALFA detections that may be useful for defining suitable reference 
samples for the HI content of galaxies (Section [3]); and carry out a comparative study of 
the distribution of the optical structure, colors, and morphologies of the objects included 
in the selected datasets (Section H]). In Paper II, we will apply strategies of multivariate 
data analysis to a complete subset of the HI emitters in low density environments defined 
here. Our aim in that paper will be both to explore inter-variable correlations and to 
determine the linear combinations of intrinsic properties that best represent these systems. 
The relationships we derive will define the standards of normalcy for the neutral gas mass 
content of field galaxies, and thus provide a fundamental reference for future research into 
the evolutionary effects of the galactic environment. 

Unless otherwise stated, to calculate intrinsic quantities we adopt a reduced Hubble 
constant h = iyo/(100km s"^ Mpc'^) = 0.7. 



2. MAIN 21-CM AND OPTICAL CATALOGS 

We describe here the primary databases from which we assembled the galaxy samples 
used in the present study and examine their statistical properties. This section also outlines 
the steps followed to add information on the radial distance of the selected galaxies to the 
measurements already available. 
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2.1. Statistical Properties of Catalogs and Cross-identification of Sources 

The main catalog used in this paper contains ALFALFA HI measurements from two 
regions of the high Galactic latitude sky: 07^^30™ < R.A.(J2000.0) < le^'SO'", +4° < 
Decl.(J2000.0) < +16° and +24° < Decl.(J2000.0) < +28°. This gives a total coverage 
of 2160 deg2. 

The ALFALFA HI detection threshold and com pleteness limit depe nd on the source 
signal-to-noise ratio, S/N, given by the expression (cf. Martin et al-lboool ): 



S/N 



1000 Fh I 

Tl/obs ^ ' 
50 rms 



where Fhi is the 21-cm line flux integral in Jy km s~^, which in combination with the 
cosmological distance to the source, d, in Mpc (see Section I2.2p . provides its total 
neutral gas mass in /i"^ Mq units 



Mhi = 2.356 X lO^d'Fnj , (2) 

W§Q^ is the observed velocity width of the source line profile at the 50% level of the two peaks 
corrected for instrumental broadening in km s~^, Wsmo is a smoothing width (either 
for W^q"" < 400 km s~^ or 20 for W^q"" > 400 km s~^), and cJrms is the rms noise figure 
measured in mJy at a resolution of 10 km s~^. ALFALFA detections are coded according to 
a scheme dependent on S/N and availability of a prior, e.g., an optical source that matches in 
angular coordinates or redshift. Code 1 sources are detections with a 5'/A^ > 6.5 and thought 
to be rehable with high confidence level; code 2 sources typically have 4 < S/N < 6.5, but 
the marginal H I detection is corroborated by a match in position and redshift of an optical 
source. By searching through the ALFALFA data for code 1 and code 2 sources, we came 
up with 15,043 extragalactic objects selected purely on their HI content in the sky region 
defined above and over the full survey bandwidth. 

The portion of the ALFALFA survey we use overlaps with other major galaxy surveys 
such as 2MASS and SDSS. The latter — which has scanned about one quarter of the northern 
Galactic cap sky in the bands u, Q, r, i, and z, reaching objects up to a total magnitude 



of 22.2 in r band (jStoughton et al.l 120021 ) — is especially interesting for our purposes, as 
it provides a uniform, very extensive list of photometric and spectroscopic measurements 
suitable for carrying out an exhaustive study of the correlations between the gaseous and 
stellar prope rties of galaxies. Here we make use of SDSS data coming from the full survey 



(SDSS DR7: lAbazajian et al.ll2009l ). which also provides radial velocity measurements for a 
considerable number of its main targets. The spectroscopic observations, which are complete 
to a Petrosian r-band magnitude of 17.77 after correcting for Galactic extinction (but not for 
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internal reddening), offer a redshift coverage that is deep enough to allow the identification of 
the most probable optical counterpart to all ALFALFA detections found out to ~ 260 Mpc 
within the sky region where the two surveys overlap0. A graphical representation of the 
positions of the ALFALFA sources in the overlapping region is provided in Figure [T] by 
means of a wedge diagram in right ascension. The total number of HI detections coded 
1 and 2 in the SDSS overlap region is 11,239 after discarding objects that he closer than 
1.5 deg to M87, where HI sources cannot be reliably detected because of the strong radio 
continuum emission. 

Some of the main statistical properties of this latter data set are displayed in Figure [2j 
In the top left panel, we show the distance distribution of the ALFALFA sources. The most 
important large-scale features are reflected in the various peaks between ~ 80 and 160 Mpc 
(see also Figure [T]). The shape of this histogram follows that produced by the distance 
distribution of SDSS galaxies (black solid line) closely, except for the narrow artificial dip 
seen near ~ 125 Mpc and the broader one between ~ 215 and 230 Mpc. Both of these are 
related to the important loss in sensitivity of the ALFALF A survey due to radio frequency 



interference (RFI) coming from the San Juan airport's radar (IGiovanelli et al.ll2007l ). The HI 
mass distributions for the full data set and from code 1 detections only (hatched histogram) 
are shown in the top right panel of Figure [2j The vertical dotted line illustrates the minimum 
detectable HI mass arising from the lower heliocentric velocity cutoff of 2000 km s~^ adopted 
in Section 13.11 so as not to include the most uncertain values of this quantity. (Note that 
this also leaves many nearby dwarfs that are rich in neutral gas out of the present study.) 
Below, the bottom left panel features the Spanhauer diagram, detailing the HI mass vs. 
CMB distance. The broad gap in detections associated with the region most affected by the 
intrusion of RFI is clearly visible near the right-hand end. Finally, the bottom right panel 
of Figure [2] shows the intrinsic velocity width distribution. As in the top right panel, we also 
depict the distribution arising only from code 1 objects to demonstrate that the inclusion of 
code 2 detections does not bias the results. 

Unlike other H I surveys, the improved angular r esolution of ALFALFA — which leads to 



positional accuracies typically better than 20 arcsec ( IGiovanelli et al.ll2007l ) — usually makes 
the identification of the optical counterpart of a given H I source an unambiguous task. Fur- 
thermore, the fact that, for each HI detection, the ALFALFA catalog provides the centroid of 
the optical galaxy that appears to b e the most plausible co unterpart according to proximity 



in z-space, color, and morphology (IGiovanelli et al.l 120071 ). enabled us to perform a semi- 



^See http://egg.astro.cornell.edu/alfalfa/images/alfalfa_sdss_coverage.jpg for an image 
of the footprint of ALFA LFA on the SDSS, whose geometry we defined using mangle polygons 
iHamilton fc Tegmarkll2004l ) . 
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automated catalog cross-correlation. Specifically, we assumed that the optical counterpart 
of a HI detection provided by the ALFALFA catalog and a spectroscopic SDSS detection 
are the same object if their projected separation on the plane of the sky is not larger than 
the size of the optical source and the difference in the reported velocities is less than 300 
km s~^. We allow for such a relatively large discrepancy between the observed heliocentric 
velocities in order to account for the fact that SDSS spectroscopic measurements on an ex- 
tended object do not always reflect the radial velocity of the object's centroid, so a positive 
match between two galaxies relies essentially on the similarity of their sky coordinates. In 
practice, however, the dispersion in the difference between radial velocities of matched pairs 
is only of ~ 35 km s~^ (see Figure [3]). For the cases in which ALFALFA sources were lacking 
a spectroscopic counterpart, the photometric criterion was applied. Human intervention was 
reduced to only cases of doubtful identification. 

Table [1] lists the numbers (and fractions) of ALFALFA objects with a photometric or a 
spectroscopic counterpart in the SDSS for different bins of radial heliocentric velocity. Only 
~ 2% of the HI sources are not associated with galaxies listed in the SDSS photometric 
catalog, while there is a somewhat larger percentage, ~ 15%, of HI detections without a 
spectroscopic counterpart. Note also how the slight incompleteness of the S DSS DR7 data 



set th at affects mainly the galaxies with the largest apparent brightness (e.g.. IStrauss et al. 



20021 ) leaves its imprint in the form of reduced fractions for the first two velocity bins of 
the third and fourth columns of the table. Also included in Table [1] are the numbers (and 
fractions) of ALFALFA objects detected within the 'SDSS spectroscopic galaxy sample', 
SDSS-spec sample for short. From now on, we will use this shorter designation to refer to 
the subset of SDSS sources with both photometric and spectral morphology classified as of 
galaxy type, and with r < 17.77 mag. About one in every five galaxies in this subset is also an 
ALFALFA detection, although the fraction rises to one in two for czhei < 7500 km s~^. The 
small value quoted in the higher velocity entry of the last column of Tabled] also reflects the 
impact of the substantial incompleteness of the ALFALFA data due to RFI above ~ 15, 000 
km s~^. The effects on the completeness of a few other narrower RFI features present in the 
spectral bandpass are comparatively modest. 

Figure H] shows how the galaxies in both catalogs are distributed in a global color-light 
concentration parameter space. The plot shows {u—r) color versus inverse light concentration 
index, defined as the ratio of the radii containing 50% and 90% of the Petrosian galaxy light, 
C59 = i?50/-R90, which we calculate in the r-band. The density contours delineate the 
behavior of the galaxies within the SDSS-spec sample, while the data points correspond 
to those objects that are also ALFALFA members. It is clear from this figure that SDSS 
galaxies show a clear bimodality and separate into two main classes with comp arable numbers 



of objects that can be addressed as the blue- and red-sequence galaxies (e.g. iBlanton et al. 
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20031 ). In contrast, HI detections mostly populate the locus corresponding to the bluer, 
late-t ype, star-forming; system s. This trend has been known about for a very long time 



(e.g^ 



Roberts fc Havneslll994f) and it is in very good agreement with the similar offset found 



by West et al. ( 20091) using 195 Hl-selected galaxies from the Parkes Equatorial Survey 
( iGarcia-Appadoo et al.ll2009l ). We will show in Section l^?T] that, while the bimodality of the 
optically-selected sources is reinforced when looking at the global trend with increasing radial 
velocity, and hence with increasing typical luminosity of the galaxies, the H I detections are 
distributed in a similar way independently of their distance from the observer. 



2.2. Velocity and Radial Distance of Sources Referred to the CMB Frame 



To calculate radial velocities and distances referred to the Cosmic Microwave Back- 
ground (CMB) standard of rest for the galaxies in both the ALFALFA and SDSS spectro- 
scopic data sets, we first correct the heliocentric velocities listed in these two catalogs for 
peculiar motions. This is d one by adoptin g the multiattractor model of the local peculiar 
velocity field determined bv lMastersI (120051) from the best fit to data from the new /-band 
TuUy-Fisher catalog SFI++ ( ISpringob et al.l 120071 ). This model includes the spherical infall 
into Virgo and the Hydra- Cent aurus supercluster, a quadrupole correction to account for the 
smaller than average expansion out of the supergalactic plane, as well as a residual dipole 
pointing towards the Hydra-Centaurus/Shapley superclusters. The predicted peculiar mo- 
tion corrections, typically of the order of 200 km s~^, are non-negligible for objects within 
6000 km s~^. Beyond that radial velocity we assume the Hubble law is exact. The peculiar 
velocities are tapered to zero between 5000 and 6000 km s~^ in order to provide a smooth 
transition between these two regimes. For transform ations between heli ocentric and CMB 
frames of reference, we apply the corrections quoted in lKogut et al.l (119931 ). For conversion to 
a CMB luminosity distance, we assume Euclidean properties of space, an approximation that 
is maintained for the calculation of any intrinsic galactic property involved in the present 
study. 

To minimize the impact of redshift-space distortions in the assignment of distances (and, 
consequently, in local density estimates; see Section 13. ip to galaxies lying in the vicinity of 
the groups and clusters present in the cosmic volume targeted by ALFALFA, we adopt a 
strategy that can easily be applied in a systematic way to any virialized aggregation of 
galaxies independently of its scale. It consists of estimating the virial radius of the system, 
Tvir, and its velocity dispersion along the line-of-sight, a, and then using this information 
to assign the corresponding system distance to all galaxies with projected distances to the 
center of one of these overdense environments of less than 1.5rvir and systemic velocities 
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with respect to the system mean velocity less than 3 a. To calculate rvir in Mpc, we use the 
following approximation to the halo density-radius relation based on a singular isothermal 
sphere with mean interior density 200 times the current value of the critical density of the 
universe (cf. ICarlberg. Yee. &: EUingsoru 119971 ): 



cr 



700 



(3) 



Values for a, in km s~ \ mostly come from the compilation of Abell cluster properties by 
Struble &: RoodI (Il99l[ ). while for the few galaxy aggregations whose veloci ty dispersion is 
not qu oted in that catalog, we use the r nembership assign ments carried out by lSpringob et al. 



( 120071 ) in the standard estimator (e.g. jTago et al.l 120081 ) 



a 



[l + z 



i=l 



[CZi 



cz) 



(4) 



where the sum is intended over the Ny members of the aggregation with measured velocity 
CZi, and z is the mean system redshift. 



3. HI CONTROL SAMPLES 

Because the cold gas component of galaxies is both a good indicator of their potential for 
star formation and an excellent probe of the physical conditions of the intergalactic medium, 
the quantification of the HI content in galaxies is a useful diagnostic tool for studying the 
evolutionary effects of their surroundings. 

The definition of a benchmark suitable for absolute measures of H I content necessitates 
a control sample as bias-free as possible formed by galaxies expected to show little or no 
evidence of environmental interactions. In this section, we study different characterizations 
of the local environment of ALFALFA detections, define three possible control samples (see 
Table [2]), and explore their relative degree of adequacy. 



3.1. Local Number Density Measurements for ALFALFA and SDSS Galaxies 

The environment of a galaxy can be defined in terms of the density of galaxies located in 
its immediate vicinity. To calculate the underlying three-dimensional galaxy number density 
at a given position, we use the galaxies in the SDSS-spec data set as the nodes of a nth- 
nearest neighbor network. This calculation is carried out for both ALFALFA and SDSS 
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sources, excluding galaxies lying in those regions in which the results are affected by large 
errors or where detections are very scarce and/or unreliable. In particular, we impose a lower 
heliocentric velocity limit of 2000 km s~^ to avoid those galaxies likely having the largest 
relative uncertainty in distance arising from peculiar motions not duly accounted for — a 
constraint that also removes the closest cosmic volume where the SDSS pipeline photometry 
of large, extended galaxies is most inaccurate — , and a velocity cutoff ranging from 15,000 
km s~^ up to 16,000 km s~^ that embraces the deep trough in sensitivity of the ALFALFA's 
spectral window due to RFI (see Figures [T] and [2]) • 



We follow ICasertano fc HutI ( 119851 ) and compute a local, continuous measure of envi- 
ronment from the unbiased estimator 



P6 



15 



(5) 



where is the three-dimensional (comoving) distance to the 6th-nearest neighbor in the 
SDSS-spec sample in Mpc. In an effort to correct for redshift-space distortion^ and 
improve the accuracy of our density measures in overdense environments, pe is calculated, 
for those galaxies assigned to a cluster (Section 12.21) . using instead projected separations 
scaled by a factor o f (3/2)^/^ to convert them into a three-dimensional measure of distance 
JCooper et al.ll2005h . 



Local densities calculated this way are then corrected for the magnitude cutoff of the 
SDSS catalog by multiplying them by the inverse of the selection function 



A^iim(2)+AM(«,<5) 



ip{a, 6, z) 



J — oo 



0(M)dM 



I^t' <P{M)dM 



(6) 



which we cal culate by integrating the differential luminosity function, 0(M), of the SDSS 
provided by iBlanton et al.l (120011 ). In equation ([6]), Mn^^z) and Mm = Mh^^zm) are, 
respectively, the faintest magnitudes detectable because of the optical flux limitation at the 
galaxy redshift and at a fiducial redshift, zm, which we match up with that corresponding 
to the low velocity cutoff of 2000 km s~^ imposed to the samples, while AM{a, 6) is the 
extincti on correction as a function of right ascension (a) and declination (6), for which we 
use the ISchlegel. Finkbeiner. fc David (119981 ) values loaded in the SDSS database. 



It is worth noting, however, that this measure of the local density cannot be expanded 
to the full ALFALFA survey, which is also carrying observations within the solid angle 
22*^ < R.A.(J2000.0) < 03^ and 0° < Decl.(J2000.0) < +36°, not scanned by the SDSS. In 



^We do not correct for survey edge effects given the large size of the catalogs. 
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this case, the 2MASS (ISkrutskie et al.ll2006l ). with its full sky coverage (98% for the extended 
source catalog), offers the most sensible alternative to characterizing the environment in 
which ALFALFA detections reside. Thus, in anticipation of future research, we also assigned 
a surface density parameter /ig to each optical and gaseous spectroscopic source from the 
formula ^ 

/i6 = — ' C^) 



TTS 



6 



with sq the projected distance to the 6th-nearest neighbor, in degrees, derived taking into 
account the underlying 2MASS galaxy distribution on the sky in the Kg band down to 
13.0 mag, where the 2MASS exten ded source catalog has been estimated to be at least 
95% complete for most of the sky (jjarrett et al.l |2000| ). We do not correct this surface 
density estimator for Galactic absorption, because at this extreme near-infrared wavelength 
the opacity is insignificant for \b^^\ ^ 10°. It should not be forgotten, however, that this 
immunity to the effects of the Milky Way's dust arising from 2MASS being a survey in the 
near-infrared, biases its sensitivity regarding the morphology of the detected galaxies toward 
early-type spirals and ellipticals. 

In Figure [5l we compare the values of pe (uncorrected for redshift-space distortions) and 
/ig for galaxies in the SDSS-spec (top) and ALFALFA (bottom) data sets. The plots show 
that these two local density tracers follow a weak, non-linear relationship, with the three- 
dimensional measure of environment spanning a somewhat larger dynamic range. Note 
the relatively few observations associated with clusters in the bottom plot caused by the 
undersampling of the highest density environments by ALFALFA (see also next section). 



3.2. The Low Density Environment HI Galaxy Sample 



One possibility to define a sample of galaxies whose evolution is hypothetically less 
influenced by external, disruptive processes is to select those objects least affected by the 
clustering phenomenon. According to observational findings the influence of the cluster en- 
viron ment on the proper t ies of galaxies can ex tend up to about twice the cluster virial radius 



(e.g., iBalogh et al.lll998l : ISolanes et al.l 1200 ll ). Therefore, any density threshold adopted to 
extract a control sample of unperturbed galaxies from a given catalog should be low enough 
to leave out of it, not only bona fide cluster and group members, but also those objects 
located in adjacent regions. 

We studied the relationship between the neutral gas content of ALFALFA galaxies and 
their local three-dimensional number density measured in the previous section to see if we 
can identify the density threshold above which environmental interactions start to become 
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important. Following the works by lHaynes fc Giovanellil ( 119841 ) and lSolanes et al.l (Il996l ). the 
former property has been quantified by means of a Hl-deficiency parameter, DEFhi, that 
compares (the decimal logarithm of) the observed HI mass, M^^j^, in solar units, with the 
value expected from a galaxy free of external infiuences of the same observed morphological 
type, T"^^, and optical linear diameter, D"^^, expressed in kpc. Specifically: 

DEFhi = (logMH:(T°^^D°"^)) - logM^^^ , (8) 

so positive values of DEFhi indicate HI deficiency. This test has been restricted to those 
ALFALFA detections with Hubble types and (apparent) blue visual sizes available on the 
Arecibo General Catalog (AGO), a private galaxy database maintained by MPH and RG. In 
order to increase sample size and reduce statistical errors, we inferred expectation values for 
the HI mass for 2615 disk galaxies with Sa-Sd m orphologies using the maximum likelihood 



linear regressions of log Mh i on log D given in ISolanes et al.l (119961 ) . Since in that work 



the standards of normalcy for the H I content were defined specifically for galaxies of type 
Sa-Sc, we thus assigned Scd-Sd types to be the same as Sc. 

We present in Figure El a plot of the median DEFhi in bins of local number density. We 
show results inferred using sources with radial velocities between 2000 and 18,000 km s~^, 
as well as separately for three sub-intervals of this velocity range. Looking at the different 
panels one can see moderate oscillations in the central values of DEFhi. Nonetheless, these 
are always well within the quoted uncertainties and therefore not statistically significant, 
hinting at best a mild increase of gas deficiency with density. The weak dependence of the 
gas content of ALFALFA sources with the local density is the consequence of limiting the 
effective integration time of the survey to about 48 seconds per beam area. While ALFALFA 
has been designed to detect from massive spirals at 2; ~ 0.06 (~ 250 Mpc) to dwarfs with H I 
masses as low as Mhi ~ 10^ at the Virgo distance (e.g., Idi Serego Alighieri et al.ll2007l ). 
the highly gas-deficient galaxies that are expected to reside in the densest regions of the 
surveyed volume should escape detection unless they are located nearby (see also Figure |5]). 
It is also important to stress that the observed behavior of the gas content with local density 
is not affected by the negative offset shown by the inferred median DEFhi values. Such a 
bias obeys to several factors. On one hand, as mentioned above, we are underestimating 
the expected gaseous content of Scd-Sd galaxy types, which lowers the median DEFhi of 
our dataset about 0.15 units. Moreover, it is the fact that ALFALFA is detecting total 
21-cm fluxes for individual galaxies that, on average, are around 10% larger than their AGO 
counterparts. This reduces the average value of DEFhi by additional 0.05 units. The rest 
of the drop in the HI deficiency ca n be attributed to differences in the object selection 
with respect to ISolanes et al.l ( 119961 ). Note that the ALFALFA survey extends further in 
redshift than the sample used in this former work, which was centered on the Pisces- Perseus 
Supercluster. This allows us to deal with a significantly large number of high-Hl-mass 
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objects. Beside s, due to a les s robust selection of the control sample, we cannot discard the 



presence in the iSolanes et aU 's data of a non-negligible fraction of galaxies with a disturbed 
H I mass, which might have contributed to further reduce the standard values of this quantity 
calculated in that paper. 

The lack of a clear HI content-density correlation for the portion of the ALFALFA 
spring-sky survey under study notwithstanding, we prefer to be conservative and, as indi- 
cated at the beginning of the section, define a control sample that does not include galaxies 
located in the vicinity of an overdensity. After testing different density thresholds, we find 
that the condition pQ < 0.5 galaxies Mpc~^ discards all SDSS galaxies with spectroscopic 
information cataloged as cluster members in the surveyed region — the corresponding thresh- 
old based on the 2MASS galaxy distribution would be fiQ ~ 10 galaxies deg~^ — , as well as 
more than 91 (96) per cent of these objects with system-centric distances between 1.5-3 rvir 
(1.5-2 Tvir) and systemic velocities within 1 a of the cluster mean, which may be considered 
representative of the system outskirts population. This threshold performs similarly well 
applied to ALFALFA detections. The Low Density Environment (hereafter LDE) H I galaxy 
sample that results from selecting those ALFALFA detections obeying the above density 
constraint has a total of 5647 objects. Wedge diagrams in right ascension of the spatial dis- 
tribution of this data set, as well as the HI detections discarded for lying in higher density 
environments, are shown in Figure [3 



3.3. Isolated Galaxy Samples 

By selecting galaxies in low density regimes one may confidently exclude objects as- 
sociated with clusters and large groups, but cannot guarantee that such galaxies have not 
interacted at some point with nearby companions. That is the reason why we also studied 
the possibility of defining control samples from the ALFALFA catalog that maximize the 
chances of dealing with galaxies whose evolution has not been influenced by other objects 
in the recent past. Here we build two samples of isolated galaxies by applying two different 
kinds of isolation criteria. 

As a first approach, we de vised a select i on te chnique for the SDSS spectroscopic data 



set, similar to the one used by iPrada et al.l ( l2003l ). that combines spectroscopic with pho- 



tometric information to detect galaxies without dynamically relevant companions inside a 
given volume in z-space. Thus, we classify a galaxy i with apparent r-band total magnitude 
Ti at coordinates (a^, 6i, Zi) as a good candidate for a sample of isolated galaxies if it has no 
neighbor j with < r, + 2.5 mag within the cylindrical volume, centered on its position, of 
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radius and semi-axis 



R = Roi) 



-1/3 



and V = VqiP 



-1/3 



0"12 



(9) 



respectively, where i?o = 0.7 Mpc is the hnking- length at the adopted fiducial redshift of 2000 
km s~\ Vo = HqRq, and ip^aijSi, zt) is the selection function defined in equation ([6]) that 
keeps constant the probability of finding a galaxy in a given volume regardless of distance. 
In equation the term ai2 = 350 km s~^ is added in order to account for the distortion on 
the intergalactic distances along the line-of-sight induced by the on e-dimensional pairwise 
velocity dispersion of field galaxies ( IGuzzo et al.l 119971 : iLandyl |2002[ ). The preference of a 
0.7 Mpc linking- length has been dictated from the fact that it would take almost 3 Gyr for a 
galaxy with a peculiar velocity of 175 km s^^ to cover such a distance, while the magnitude 
constraint is set to take into account neighbors that have at least one-tenth of the target 
mass (assuming light is a proxy for mass). This technique identifies 461 SDSS-spec galaxies 
that are also HI detections as members of this first isolated galaxy (hereafter IGl) sample. 

A second isolated galaxy (IG2) sample has been defined by applying the sort of pre- 
scription typically used for photometric samples — usually much deeper than the spectro- 
scopic ones — , which takes into account in a semi-empirical way the dista nce to compan- 
i ons. Perhaps the best known example is the nearest neighbor algorithm by iKarachentseva 
(119731 ) based on the comparison of appa rent angular diame ters between candidate isolated 
ga laxies and their neighbors. Recently, lAUam et al.l (|2005[ ) have implemented a variation 
of iKarachentseval 's technique to detect isolated galaxies in the SDSS DRl that we adopt 
here with a slight modification. Thus, we consider a target galaxy i with a g-hand total 
magnitude gi and g-hand Petrosian radius Ri to be isolated if the projected sky separation 
9ij with respect to any neighboring galaxy j satisfies the conditions 



and 



\9i - 9j\ < 2.5 



(10) 



where the 2.5 magnitude difference corresponds to about a factor 9 in brightness for a 
galaxy with a fiat surface brightness profile. We also took into account the fact outlined by 
AUam et al.l (120051 ) that the application of these selection criteria to SDSS data effectively 
reduces the magnitude range of the candidate isolated galaxies at both ends (see, for instance, 
their Figures 11 and 12). Thus, by considering only candidates with Galactic-extinction- 
corrected g-hand magnitudes in the range 15.5 < gi < 19.5, we extracted a total of 236 
isolated HI emittersj. 



•^Had we followed the same approach, but imposing instead the more restrictive conditions \gi — gj\ < 3.0 
mag with 16.0 < gt < 19.0 mag used in lAllam et al-l 's paper, the number of retrieved isolated HI galaxies 
would had drop to 67 within the region surveyed by ALFALFA. 
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Given that the spectro-photometric algorithm imposes the restriction < 15.27 on the 
magnitude of IGl members, there is only a small overlap in the range of apparent brightnesses 
of the galaxies that are included in both subsets of isolated objects. In practice, this means 
that only 7 ALFALFA detections end up obeying simultaneously the two isolation criteria. 
If we eliminate from the photometric prescription the constraint gi > 15.5 mag to make the 
overlap of both subsets of isolated galaxies easier, then the number of IG2 members rises to 
470. Even so, this enlarged sample includes only 83 of the 461 isolated galaxies selected by 
the spectro-photometric algorithm. Therefore, isolated galaxies in 2;-space are not necessarily 
considered so when their environment is examined in projection and vice versa. 

As done in the previous section for the LDE HI galaxy sample, we also studied the 
environment of the ALFALFA galaxies in the IGl and IG2 subsets. As an example, we 
display in Figure [H] a graphical representation of the H I-deficiency versus local density for 
IGl galaxies. As before, we focus only on sources having optical diameters and spiral types 
from Sa to Sd listed on the AGC. As shown in this figure, while isolated galaxies obtained 
through automated systematic searches avoid the densest regions of the universe, a non- 
negligible fraction of them inhabit regions of moderate local galaxy density having values of 
Pe larger than the upper density threshold adopted to define our LDE HI galaxy sample. 
This demonstrates that the definitions of isolation and low density environment are not 
necessarily equivalent; local number density measurements associated w ith isolated galaxie s 



can span quite a broad range of values (for a similar conclusion, see IVerley et al.l 120071 ). 
Alongside this fact. Figure [S] also shows that although the different approaches adopted to 
identify field galaxies assemble samples of significantly different sizes, the corresponding gas 
content distributions are quite similar. There is therefore no evidence that our searches of 
isolated galaxies selected objects that are particularly gas-rich compared to those in the LDE 
galaxy sample. 



4. UNDERLYING PROPERTIES OF GALAXIES IN THE HI CONTROL 

SAMPLES 

Here we use several independent classification schemes based on the photometric data 
for the galaxies in the three candidate control samples defined in the previous section to 
provide more details and increase our understanding of the objects that inhabit quiet envi- 
ronments. These measures are all sensitive to the morphology and they are objective, easily 
reproducible, and applicable to data sets in which the morphological classification by direct 
inspection of the galaxy images is impractical, given the large number of objects involved. 
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4.1. Structure, Colors, and Morphology of the LDE Galaxies 



Since we are focusing our attention on subsets of non-clustered , gas-r ich galaxi es, pre - 
dictably of late type, we first apply the criterion used by lMaller et al.l (120091 : hereafter iMalOOl ) 
to select disk galaxies from the SDSS. As stated by those authors, almost pure disk samples 
can be obtained by selecting galaxies that obey at least one of these requirements: Sersic in- 
dex n < 3 or observed axial ratio b/a < 0.55, where the parame ter n — which in our case has 
been extracted from the NYU Value-Added Galaxy Catalog (IBlanton et al.l l2005l ) — mea- 



sures the shape of the observed r-band luminosity profile of a galaxy fitted using the Sersic 
j^i/n formula with elliptical isophotes. We note that these constraints include SO galaxies in 
the 'disk' class, because the presence of a disk makes their measured axial ratio more impor- 
tant in determining their appare nt inclination than the intrinsic ellipticity of the spheroidal 
component. According to lMalOQi . the completeness of their classification scheme reaches 70% 
for disk galaxies, with a reliability of ~ 95% when applied to SDSS data. The completeness 
is the fraction of galaxies of a given type that are successfully selected from the original 
sample by the classification scheme. The reliability is the fraction of galaxies of the desired 
type in the selected subset. 

In good agreement with expectations. Table [3] shows that the fraction of members 
of th e SPSS -spec sample in low density environments (pe ^ 0.5 galaxies Mpc~^) that 
obey lMal09l 's criterion is very high (~ 85% on average), and even higher for the subset of 
ALFALFA detections (~ 92%). Our global percentages of late-type objects are, however, 
slightly lower than those obtained by lMal09l . This result can be attributed to the fact that 
they were inferred for a very shallow data set with < —20.6 mag (note that the agree- 
ment between our results and lMal09l 's becomes almost perfect if we restrict the comparison 
to our lowest velocity bin). In principle, the decrement in the fraction of galaxies assigned 
to the disk class with increasing radial velocity illustrated in Table [3] can be interpreted as 
an indication of the fact that most lu minous galaxies tend to be of early type, in qualitative 



accordance with previous work (e.g., IStrateva et al.ll200ll : iBlanton et al.ll2003l : iBaldry et al. 
20041 ). Yet we find that in the highest velocity bin ~ 10% of the ALFALFA galaxies are not 



classified as disks, while at these relatively large distances H I detections should arise almost 
exclusively from late-type objects: at 11,000 km s~^ the minimum HI mass detectable in 
the survey is log(MHi,min) ~ 9.3 M©. Accordingly, our r esults s eemingly highlight the diffi- 



culties in morphology assignment that selection rules like lMal09l 's. based on global measures 



of galaxy structure, encounter for faint objects due to the inherent noisiness of the images. 
The inverse light concentration index, C59 = -R 50/-R90, is another measure of galaxy 



structure that can be used as a morphological divider (IShimasaku et al.ll200ll : IStrateva et al. 



2OOII ). As with the Sersic index, this parameter is expected to be correlated with galactic 
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morphology. Indeed, the two indexes can be interchanged for an ideal galaxy with a Sersic 
profile in the absence of seeing. Early-type galaxies have a dominant bulge component, so 
their light is concentrated, while late-type galaxies, which harbour bulges of varying size, have 
their light more dispersed due to the presence of an extended exponential disk component 
and hence larger C59 values. In practice, however, the concentration index is a morphological 
separator that is useful for broadly dividing galaxies i nto two populations: early-type (E, 
SO, Sa) and late-type (Sb, Sc, Irr) objects. We follow IStrateva et al.l (I2OOII ) and adopt an 
C59,T- = 0.38 separator in the r-band, which provides a cut giving equally complete subsamples 
of both morphological populations, while at the same time it reduces the contamination of 
the late-type galaxy class by early-type objects to ~ 12%. When applied to galaxies in 
LDEs, we find that most galaxies have C^g^r > 0.38, indicating the dominance of disks. As 
was to be expected, this tendency is again more marked for ALFALFA detections, about 82% 
of them fall into the light-concentration- index late-type class, than for the optical subset, 
which has a late-type galaxy fraction of ~ 70%. 

Given the correlation between color and morphology, early-type galaxies being generally 
redder than late-types, galaxy colors can also be used as a morphological classification tool, 
as well as to provide a coarse separation of galaxies in terms of their star formation activity. 



We adopt t he classification schem e of iBaldry et al. fl2004f ). which uses the SDSS model 
magnitudes (jStoughton et al.ll2002[ ) to measure color from the difference between the fluxes 
in the u and r filters and to calculate the absolute magnitude in the r band, M^, for which 
we adopt the CMB distance inferred in Section 12.21 As well as bein g an optimal color 
for separating galaxies into red an d blue classes (IStrateva et al.ll200ll ). {u — r) is a good 
indicator of star formation activity ( Deng fc Zou 20091 ). while the r filter contains the peak 
of the light curve for most SDSS galaxies. In this analysis, we do not apply fC-corrections 
since the samples studied are shallow. 

Figure [ H] shows the co lor-magnitude diagram for ALFALFA detections in the LDE sam- 
ple with the iBaldry et al.l delimiter. 



[u — r] 



2.06 - 0.244 tanh 



Mr + 20.07 



:ii^ 



superposed. Not surprisingly, we see quite a few more sources on the blue, active population 
side of the divider (below the curve) than on the red, passive side: 83% and 17% of the total, 
respectively. Moreover, we note that most of the H I galaxies above the separator reside in an 
area of the color-magnitude diagram around Mr 5, —20 mag and (u — r) > 2.3 mag where the 
blue and red galaxy populations identified by IBaldry et al.l (120041 ) overlap in this space (see 
their Figure 9). Therefore, it is not unlikely that many of these galaxies are actually blue 
interlopers. Indeed, if we repeat the same exercise after correcting the SDSS magnitudes 
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and co lors to face-on values, following respectively IShao et al.l (120071 ) and [Masters et al. 
( l2010a| ). and keeping the same separator, the fraction of HI detections assigned to the blue 
group increases to 92%. This highlights the importance of defining morphological separators 
corrected for internal extinction, since the effects of dust on the colors and concentration of 
late-type galaxies can be significant for edge-on systems. 

A slightly coarser color division, but one that facilitates the analysis of the data, is 
provided by the luminosity- indepe ndent (u — r) c o lor cu t of 2.22 mag — also uncorrected for 
internal extinction — suggested by lStrateva et al.l ( 1200 ll ). In the four panels of Figure [TDl we 



show the observed color distributions of the SDSS-spec galaxies in LDEs separated into HI- 
detections and non-detections for different bins of radial heliocentric velocity. As illustrated 
by the histograms, the global fraction of SDSS galaxies in LDEs that are of red-/early- 
type becomes notably higher with increasing distance, in agreement with the well-known 
color-luminosity-morphology(-star formation activity) correlation (see also Figure [9l) , which 
appears to hold regardless of local galaxy density (e.g.. lBall. Loveday. fc Brunneiil2008l ). Yet 
it is obvious from these plots that the observed behavior arises almost exclusively from the 
contribution of galaxies undete c ted in HI. For these systems, the fraction of red objects, 
according to the IStrateva et al.l ( 1200 ll ) separator, grows steadily from less than 15% in the 
lowest velocity bin, to about 39% in the highest one, where ~ 96% of the SDSS galaxies 
redder than the divider are HI non-detections. In sharp contrast, field HI emitters consist of 
(mostly blue) objects whose color distribution shifts only very slighty redwards with increas- 
ing distance, to the point that the ratio between the blue and red fractions for this subset 
remains nearly constant at ~ 11 ±4% in all redshift bins (~ 5±2% if one uses colors corrected 
for internal extinction) and, hence, approximately independent of the mean H I mass of the 
detections. This result contrasts with previous claims, based on optically-selected sources, 
that blue exponential systems, just as with red concentrat e d ones, become monotonically 
redder with increasing luminosity (e.g. iBlanton et al.l 120031 : iBall et al.l |2008|) and suggests 
that the color and H I mass of gaseous galaxies cannot be very strongly related (see also Sec- 
tion U]2]). In Paper II we will return to this issue, which also has fundamental implications 
for the theory of galaxy formation regarding the nu mber of indeperident physical parameters 
that determin e the properties of individual galaxies (jCavazzi et al. 1996 : Disney et al. 2008 
Gavazzi|[2009l ). 



The Galaxy Zoo 1 (GZl) clean catalog (ILintott et al.ll2010l ). which provides visual mor- 
phological classifications for nearly 900,000 SDSS galaxies, offers the possibility of directly 
inspecting the basic morphology of a highly significant fraction of our HI detections. To 
this end, we focused our attention on the subset of 4599 GZl objects that are members 
of the SDSS D R7 spectrosco pic survey and are also included in our LDE HI galaxy sam- 



ple. Following iLintott et al.l . we were rather strict in assigning GZl morphologies to our 
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ALFALFA objects by requiring that they have a 'spiral' or an 'elhptical' flag in the clean 
catalog, i.e., with more than 80% confidence in category assignation after debiasing cor- 
rections. All other galaxies were classified as of uncertain type — as shown in Figure [TTl 
most of the ALFALFA detections with uncertain GZl morphology are objects with n < 2 
but b/a > 0.4. Comparison with the Hubble types listed in the AGC indicates that the 
reliability of this classification scheme for the spiral class is higher than 90%, with a ^ 3 



contam inat ion from SO/S O a gal axies, in good agreement with estimates by iLintott et al. 
(120081 ) and lBamford et al.l (120091 ). In this manner, we infer that ~ 52% of our HI detections 
can confidently be assigned a spiral (disk with spiral arms) morphology, whereas only ~ 1% 
are galaxies with a high probability of belonging to the elliptical class (E and SO galaxies). 
The fact that all the objects in the latter class have b/a > 0.5 and that m ost of them have 
n > 3 (r-band values) implies that prescriptions like that used by lMal09l are very effective 
at sep arating out objects of early-type morphology. Indeed, we find that only 19 of the 4057 



MalOS disks identified in this subset of GZl members are classified by the Galaxy Zoo as 



bona fide ellipticals. We note in addition that just 78 of the 4599 members of the L DE-GZl 
HI subset have a weighted-merger- vote fraction larger than 40% (jParg et al.ll2010l ). 

We extended this a step further by examining by eye the SDSS images of those LDE 
gaseous galaxies that show optical early-type characteristics according to both their color 
and Sersic index. We concentrated our efforts on objects with extreme inclinations that 
meet the following criteria: {u — r) > 2.22 mag, n > 4.0, and b/a < 0.5 or b /a > 0.85 
(the latter two parameters measured in the r-band). In agreement with lMal09l . we found 
that most of our edge-on, red, concentrated HI sources have a clear disk-like morphology 
suggesting that they owe their early-type classification to large amounts of dust reddening. 
This morphology also appears to be the dominant one among the face-on gas-rich galaxies 
identified with large Sersic indexes and red colors, in which luminous red central regions and 
large bars are plentiful. We speculate that these latter objects represent a stage of spiral 
evolution t hat is not as advanced as that assigned to the population of red spirals recently 
studied by [Masters et al.l (l2010bl ). since ours still have very blue, but faint, outer disks and 
are quite rich in neutral gas (several of them have Mht > 10^°M(7^), in good agreement with 
the findings of the GALEX Arecibo SDSS Survey (IWang et al.ll2010l ). Therefore, we are left 
with only a handful of cases that may correspond to genuine gas-rich early-type galaxies 
(ETGs) in which the cold gas could have a recent external origin. While the scarcity of 
this sort of 'hybrid system' in our LDE sa mple is consistent with the ve ry few large ETGs 
detected in HI in the Virgo cluster region (Idi Serego Alighieri et al.l 120071 ). it seems to be at 
odds with the preliminary res ults of the ATLA S^^ survey of 262 nearby galaxies that lack 
spiral arms in optical images (ISerra et al.ll2010l ). which hint at the possibility that isolated 
ETGs embedded in massive HI disks are relatively abundant. 
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The results discussed above, plus the fact that the automated classification of galaxies 
into two major populations using extensive stellar parameters leads to subdivisions that are 
partially contaminated by members of the other class, and therefore to the standardization 
of the trends, lead us to conclude that H I emission offers the most reliable (and clear) way 
to determine whether a galaxy has genuinely an early-type or late-type morphology. 



4.2. Structure, Colors, and Morphology of the Members of the IGl and IG2 

Subsets 

The exercises reported in the previous section for ALFALFA detections and non-detections 
in LDEs were repeated for the samples of isolated galaxies. 

As illustrated in Figure [121 we find that, regardless of the isolation criterion adopted, the 
population of H I detections is notably bluer than that of non-detections, in good agreement 
with our results for galaxies in LDEs. There are, however, significant differences in the color 
distributions of both samples of isolated galaxies. Thus, while the colors of the IG2 members 
mimic the behavior shown by the LDE galaxies at the median depth of the ALFALFA survey 
(bottom-left panel of Figure [TOj) . the IGl members display nearly unimodal distributions — 
there is a hint of bimodality in the histogram of H I detections — strongly skewed to the red. 
Likewise, we find that IGl ga laxies tend to have both smaller light concentration indexes 



and a lower fraction of lMal09l disks than galaxies in both the IG2 and LDE data sets, as 
would be expected from a population more dominated by early-type objects. Indeed, if we 
restrict our attention to just the ALFALFA galaxies with Hubble types listed in the AGC, 
we find that the latter two samples are considerably richer in Scd and Sd spiral subtypes 
than the IGl subset is. 

These discrepancies, however, do not stem from fundamental differences between the 
properties of the objects belonging to the IGl and IG2 samples, but from the distinct se- 
lection criteria adopted to draw up their member galaxies. As illustrated by the top panels 
of Figure [T31 while the luminosity distribution of the LDE galaxies (as well as that of the 
full ALFALFA catalog) moves clearly towards brighter values with increasing distance, the 
corresponding color distribution, which can be reasonably approximated by a unimodal func- 
tion peaking around the characteristic color of the blue sequence and having a long red tail 
(see also Figure [TUI) . shows a modest variation. This rough distance- invariance is reinforced 
by the lack of a substantial red-sequence component for the HI emitters, which start con- 
tributing significantly to the color distribution of the entire galactic population at somewhat 
larger redshifts than the blue one. On the other hand, the isolation selection techniques 
we used place limits on the apparent brightness that the candidate galaxies can have. This 
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inevitably leads to choosing from among the brightest objects that are within the allowed 
magnitude range and, consequently, also among the reddest. Such a bias is stronger in the 
case of our spectro-photometric IGl sample, which excludes objects fainter than r = 15.27 
mag. As a result, most of the HI sources around the peak of the blue sequence are missing 
from the color distribution of this subset, which only preserves its red tail (middle right panel 
of Figure [T3|l . In contrast, the almost disjoint constraint 15.5 < gi < 19.5 mag adopted for 
the definition of the IG2 sample from the relatively shallow ALFALFA data set has led us 
to select galaxies that are just slightly intrinsically brighter and, therefore, slightly redder 
too, than the average LDE members (bottom panels of F igure [T3l). Note, h owever, that 
the application of the photometric isolation prescription by lAUam et al.l (120051 ) to the much 
deeper SDSS DRl catalog led these authors to obtain an isolated galaxy sample with a 
light concentration index distribution that suggested a fifty-fifty morphological composition 
between early and late galaxy types, closer to what we observe here for the IGl subset. 



5. SUMMARY AND CONCLUSIONS 

We have used measurements in the 21-cm emission line from the ALFALFA blind sur- 
vey in a region of the sky also scanned by the SDSS DR7 in order to study the physical 
properties of gas-rich galaxies expected to show little or no evidence of interaction with their 
surroundings. The goal has been twofold: 1) to improve the understanding of the nature of 
extragalactic HI sources; and 2) to identify an observational sample of gaseous galaxies as 
unperturbed as possible by the environment and therefore suitable to set up standards for 
their neutral gas content. 

Different parametric estimates of the environment of H I sources have been explored, in 
order to establish their suitability in providing a catalog of gaseous systems whose properties 
are minimally affected by external influences. Among the environmental measures tested, 
the local number density estimate based on the distance to the 6th-nearest spectroscopic 
SDSS neighbor of a galaxy has emerged as the most appropriate. By applying this estimator 
to the 11,239 HI detections identified in the overlapping region of the radio and optical 
surveys, we have extracted a subset of 5647 HI sources in low density environments. We 
have also defined two different subsets of isolated galaxies by applying standard isolation 
criteria based either on the combination of spectroscopic and photometric information or 
solely on photometric data. In spite of the marked differences in size and membership 
between the LDE and isolated subsets, their respective H I content distributions have proven 
to be all pretty similar. 

In combination with SDSS data, we have studied the distributions of light concentration 
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indexes, colors, and other proxies of morphology for the members of these three control 
samples. All the evidence analyzed indicate that HI emission is detected essentially in 
objects that are structurally similar to galaxies with late-type optical characteristics. Thus, 
over 9 0% of HI sources residing in a quite environment obey the disk selection rule by 



Mal09l . while some 82 per cent have inverse light concentration indexes representative of this 
class. These fractions could be considered actually a lower limit, as the determination of the 
structural parameters of distant objects is quite imprecise. Likewise, from evidence gathered 
by studying the color distributions, we have found that at least between 85% and 90% of the 
ALFALFA detections outside overdense regions are classified as blue, star-forming galaxies. 
These fractions are even higher when extinction-corrected colors are used. In contrast, the 
structural properties and colors of H I non-detections are typical of a population substantially 
richer in early-type, red objects. The assignment of morphologies to a subset of ALFALFA 
galaxies in LDEs from the GZl catalog corroborates these conclusions, as only a very small 
fraction (~ 1 per cent) of the objects studied have been classified into the elliptical class with 
high reliability. These results suggest that the more reliable automated way to determine 
whether a galaxy has genuinely an early- or late-type morphology is by measuring its HI 
emission. 

The standard view that red galaxies and, consequently, passive objects of type SO/Sa 
or earlier, tend to have less neutral hydrogen than bluer, star-forming late-type systems 
is therefore confirmed. Our research also corroborates the prediction, based on the positive 
tight correlation between global color and intrinsic luminosity inferred from optically-selected 
galaxies, that the fraction of red-/early-type galaxies in our SDSS-spec LDE sample should 
increase with increasing radial velocity of the sources. We observe, however, that while the 
color distribution of the objects undetected in HI clearly becomes redder, that of the gas-rich 
population does not change significantly as a function of redshift. This suggests that the color 
and gaseous mass of field HI emitters are not very strongly interrelated. Likewise, we have 
shown that the close connection between color and luminosity can have an important impact 
on isolation criteria, favoring the selection of the redder /earlier members of a catalog, the 
shallower the survey the most notorious the bias. Awareness of this selection effect, together 
with the relatively modest sizes and reduced dynamic range subtended by the properties of 
the members of our isolated galaxy samples — which are a direct consequence of the severe 
restrictions imposed by the selection algorithms on the range of apparent magnitudes — , leads 
us to conclude that the LDE approach provides the most suitable and statistically sounder 
control sample to set standards of reference for the HI content of galaxies. In Paper II we 
will present a detailed derivation of such standards by applying a non-parametric principal 
component analysis to a flux-limited subset of this latter dataset. 
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Table 1. Distributions of ALFALFA and SDSS Galaxies in Radial Velocity Bins 





ALFA 


ALFA/SDSS-phot 


ALFA/SDSS-spec* 


SDSS-spec 


SDSS-spec/ ALFA 


(km s-i) 




^gal {%) 


Wgal (%) 




^gal (%) 


< 500 


71 


65 (91%) 


50 (77%) 


37 


18 (49%) 


500-4000 


1523 


1417 (93%) 


1024 (72%) 


1702 


910 (53%) 


4000-7500 


3104 


3066 (99%) 


2529 (82%) 


5102 


2420 (47%) 


7500-11,000 


3385 


3353 (99%) 


2947 (88%) 


9196 


2853 (31%) 


11,000-14,500 


2280 


2261 (99%) 


2025 (89%) 


9341 


1977 (21%) 


14,500-18,000 


876 


871 (99%) 


807 (93%) 


10187 


787 ( 8%) 


Totals 


11,239 


11,033 (98%) 


9382 (85%) 


35,565 


8965 (25%) 



Note. — Numbers in different velocity bins of ALFALFA galaxies (ALFA), Hi detections with a photo- 
metric counterpart in SDSS (ALFA/SDSS-phot), and Hi detections with a spectroscopic counterpart in SDSS 
(ALFA/SDSS-spec*), regardless of the morphological classification of their SDSS spectrum, in the region where 
both catalogs overlap. SDSS-spec designates the sample of SDSS sources with both photometric and spectral 
morphology classified as of galaxy type and having r < 17.77 mag, while SDSS-spec/ ALFA identifies the subset 
of SDSS-spec galaxies with an ALFALFA counterpart. Fractions are given in parentheses. 



Table 2. HI Control Samples 



Acronym 


^'gal 


Selection criterion 


LDE 

IGl 

IG2 


5647 
461 
236 


Low local galaxy density environment (pg < 0.5 ft.'' galaxies Mpc"'') 
Isolation inferred from spectro-photometric information (Equation O 
Isolation inferred from photometric information (Equation I10|l 



Table 3. ALFALFA Detections and SDSS Galaxies in LDEs Obeying iMalOQf s Criterion 







SDSS-spec 


SDSS-spec/ ALFA 


(km s~^) 


^'gal 


A^disk (%) 


^gal 


AfdiBk (%) 


2000-7000 


2063 


1893 (91.7%) 


1330 


1257 (94.5%) 


7000-11,000 


3912 


3403 (87.0%) 


1529 


1411 (92.3%) 


11,000-18,000 


7889 


6472 (82.0%) 


1555 


1414 (90.9%) 


Totals 


13864 


11768 (84.9%) 


4414 


4082 (92.5%) 



Note. — Numbers in different velocity bins of SDSS-spec and 
SDSS-spec/ALFA galaxies (the definition of these subsets can be 
foun d in the caption of Table [TJ in low density environments satis- 
fving [MalOgl 's disk-identification criterion (N^ig]^). We only consider 
galaxies with (r-band) Sersic index 0.5 < n < 5.9 and axis ratio 
0.15 < b/a < 1. Fractions are given in parentheses. 
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Fig. 1. — Sky distribution of ALFALFA sources in the sky region where it overlaps with the SDSS 
DR7 spectroscopic survey. 
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Distance [Mpc] log Wso.i [km/s] 

Fig. 2. — Top-left: Distribution of ALFALFA (solid-blue histogram) and SDSS (black line) 
galaxies according to CMB distance (see Section |Z]2|) . Top-right: HI mass distribution for 
ALFALFA detections (solid line). The distribution for the ALFALFA code 1 detections 
is represented by the dashed-blue histogram. The vertical dotted line corresponds to the 
minimum HI mass detected at 2000 km s^^. Bottom-left: Spanhauer diagram for the objects 
in the ALFALFA catalog. Bottom-right: Inclination-corrected velocity width distribution 
(black line) for ALFALFA sources. The dashed-blue histogram depicts the distribution 
arising from code 1 objects only. 
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140 



ALFALFA-SDSS position separation ["] 



2500 - 
2000 - 
1 1 500 - 



1 000 - 




300 



ALFALFA— SDSS velocity difference [l<nn/s] 

Fig. 3. — Top: Distribution of differences between HI and SDSS sky positions for ALFALFA 
detections with a SDSS photometric counterpart. Bottom: Distribution of differences between 
HI and SDSS radial heliocentric velocities for ALFALFA detections with a SDSS spectroscopic 
counterpart. 
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0.5 1.0 1.5 2.0 2.5 3.0 




0.5 1.0 1.5 2.0 2.5 3.0 

(u-r) 

Fig. 4. — Inverse light concentration index in the r band, Csg^.^, versus (u — r) color for members of 
the SDSS spectroscopic sample (black contours), and for those that are also ALFALFA detections 
(blue dots). Contours are spaced at intervals of 40 density units. The histograms on the right and 
top sides of the main panel show the distributions of the corresponding variables for these two data 
sets normalized to peak values. 
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Fig. 5. — Three-dimensional local density of galaxies derived using the distance to the 6th-nearest 
neighbor, pQ (Equation [5]), in the SDSS-spec data set vs. the corresponding projected density 
estimator, fiQ (Equation [7]) , calculated from the 2MASS galaxy distribution in the Ks band down 
to 13.0 mag. We show results for SDSS-spec galaxies (top) and for ALFALFA detections (bottom). 
Contours are spaced at intervals of 50 (top) and 25 (bottom) density units. Galaxies classified 
as cluster members are drawn using big red dots, whereas green dots are used for the clusters' 
outskirts population. In these plots, the values of pe for cluster members have not been corrected 
for redshift-space distortions. 



- 34 - 




1 2 

bg p6 [h^ Mpc"^] 
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Fig. 6. — Medians and inter-quartile ranges of the H I-def i ciency parameter, DEFhi, calculated 
using the standards of H I content given by ISolanes et alJ (jl996l ) , in equal bins of log pg up to 
log pQ[h^Mpc'^] = 1 . Above the latter value, only two larger equal bins are considered. Results 
for bins containing less than 25 objects are represented by thinner lines. The different panels are 
for ALFALFA detections of Sa-Sd type with radial heliocentric velocities between 2000 and 18, 000 
km s"^ (top-left), as well as for the three subintervals: 2000-7000 km s"^ (top-right), 7000-11,000 
km s"^ (bottom-left), and 11,000-18,000 km s"^ (bottom-right). 
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Fig. 7. — Wedge diagrams in right ascension of the LDE sample of ALFALFA galaxies defined in 
the text (top) and of those ALFALFA H I detections lying in environments of higher density in the 
same sky region (bottom). 
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Fig. 8. — Top: DEFhi, calculated using the standards of H I content given by lSolanes et alJ ()l996l ). 
vs. log pq. alfalfa detections of Sa-Sd type selected by a spectro-photometric isolation criterion 
(IGl subset; see text) are shown as big red diamonds. Dots represent all Sa-Sd ALFALFA detections 
spanning the same range of (7-band magnitudes than the isolated objects (15.5 < Qi < 19.5). The 
subset of galaxies to the left of the vertical dashed line (LDE galaxies) is highlighted in blue color. 
Bottom: Corresponding medians and inter-quartile ranges of DEFhi. Results for all the ALFALFA 
galaxies depicted in the main panel are represented by error bars in equal bins of log pg up to 
log pQ[h^Mpc~^] = 1, while above the latter value only two larger equal intervals are considered. 
Results for bins containing less than 25 objects are drawn using thinner lines. The red horizontal 
lines show the median (solid) and upper and lower quartiles (dotted) of DEFhi inferred from all the 
galaxies in the IGl subset. Right: Histograms normalized to peak values showing the distribution 
of DEFhi for the subsets of LDE galaxies (blue) and isolated sources (red). Similar results are 
obtained for the IG2 subset. 
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Fig. 9. 
subset . 



, — (u — r) color versus absolute magnitude in the r-band for ALFALFA galaxi es in the LDE 
The dashed curve represents the red-blue population separator adopted by iBaldrv et al. 
Colors are uncorrected for internal extinction. 
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Fig. 10. — (u — r) color distribution for SDSS galaxies in LDEs for different bins of radial he- 
liocentric velocity. The global histograms are split into HI detections (dashed blue) and non- 
de tections (red line). Th e vertical dashed line represents the red-blue population separator adopted 
bv lstrateva et all (l200ll ). 
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Fig. 11. — Distribution of GZl morphologies for ALFALFA LDE detections in the Sersic index- 
observed axis ratio space (r-band): Spiral (top), Uncertain (middle) and Elliptical (bottom). In 
each pan el, the top-right corner region delimited by dashed lines encompasses those objects not 
verifying |Mal09|'s disks criterion. 



-40 - 




^ 40 




Fig. 12. — (u — r) color distribution for SDSS galaxies obeying the spectro-photometric (top) 
and photometric (bottom) isolation criteria (see text). The global histograms are split into HI 
detections (dashed blue) and non -detections (red). Th e vertical dashed line represents the red-blue 
population separator adopted by IStrateva et al.l (|200ll ). 
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Fig. 13. — Absolute magnitude (left) and {u — r) color (right) vs. CMB distance for the 
three ALFALFA control samples: LDE (top), IGl (middle), and IG2 (bottom). The member 
galaxies are drawn as green dots, whereas the middle and bottom plots also depict all the 
ALFALFA detections as gray dots. The horizontal dashed lines in the {u — r) color plots 
show the peak values corresponding to the red and blue populations (see also Figures [TO] and 



